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ABSTRACT 

Galaxy surveys that map multiple species of tracers of large-scale structure can im- 
prove the constraints on some cosmological parameters far beyond the limits imposed 
by a simplistic interpretation of cosmic variance. This enhancement derives from com- 
paring the relative clustering between different tracers of large-scale structure. We 
present a simple but fully generic expression for the Fisher information matrix of 
surveys with any (discrete) number of tracers, and show that the enhancement of 
the constraints on bias-sensitive parameters are a straightforward consequence of this 
multi-tracer Fisher matrix. In fact, the relative clustering amplitudes between trac- 
ers are eigenvectors of this multi-tracer Fisher matrix. The diagonalized multi-tracer 
Fisher matrix clearly shows that while the effective volume is bounded by the phys- 
ical volume of the survey, the relational information between species is unbounded. 
As an application, we study the expected enhancements in the constraints of realistic 
surveys that aim at mapping several different types of tracers of large-scale structure. 
The gain obtained by combining multiple tracers is highest at low redshifts, and in 
one particular scenario we analyzed, the enhancement can be as large as a factor of 
> 3 for the accuracy in the determination of the redshift distortion parameter, and a 
factor > 5 for the local non-Gaussianity parameter /nl- Radial and angular distance 
determinations from the baryonic features in the power spectrum may also benefit 
from the multi-tracer approach. 

Key words: cosmology: theory - large-scale structure of the Universe 



1 INTRODUCTION 

Recent advances in Cosmology, in particular those related 
to the present epoch of accelerated expansion, have been 
overwhelmingly driven b y astrophysical surv e ys dYork et al 



et al.l 120051; Abbott et al. 2005; Scoville et al 



ICole _. 

'Adelman-McCarthv et all l2008allbl; PAN-STARRSt 



BOSS ; Blake et "aHl201ll l. The legacies of these surveys are a 
set of increasingly tight constraints on cosmological param- 
eters and powerful tests of the robustness of the standard 
cosmological model, but also an alarming confirmation that 
there are gaping holes in that model. 

The landscape of available instruments and observa- 
tions is rapidly evolving. Besides general-purpose efforts that 
push the li mits of what was achieved by the Sloan Digital 
Sky S urvey ijYo rk et al. 2000) both in wavele ngth and in red- 
shift (|BigBOSSl ; ISUMIREt lEllis et alj|20l3 h now cosmolo- 
gists can also count on su rveys with a cadence aimed a t dis- 
covering variable objects (Rau 20091; lAbell et ai1l2009h . and 
mapping large volumes of the universe with high complete- 
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ness. The latter is achieve d by imaging large ar eas of the sky 
with narrow-band filters (|Bemtez et al.l 120091 ) or resorting 
to low-resolut ion (spect ral as well as spatial) integral-field 
spectroscopy (|Hil]|l200ct ). 

Galaxy redshift surveys are now expected to perform 
high-accuracy measurements of the equation of state of 
dark energy, test for models of mo dified gravity through 
lensing or redshift-space distortions fLinder 2005), impose 
constraints on the amplitude of non-Gaussian features i n 
the power spectrum (| Verde et al.ll200d ; lBa"rtolo et alj|2004lh 
measure neutrino masses, e tc. - see, e.g. (jBenltez et al .1120091 ; 
ISUM IRE; Ellis ct al. 2012). Since many surveys are already 
reaching large fractions of the sky with high completeness, 
the variance due to the survey's finite volume (i.e. cosmic 
variance) is perhaps the most formidable obstacle to further 
progress. 

Interestingly, it has be en pointed out bv lSeliakl |2009h : 
iMcDonald fc Seliakl (|200ctl that cosmic variance can be 
someho w circumvented for som e cosmological parameters - 
see also iGil-Marin et"aH (l20ld ); ICai fc Bernstein! (|2011al h 
lHamaus et al. I i|201ll . I2012F . This is true, m particular, 
for the redshift-space distortion (RSD) parameter, f3(z) — 
f(z)/b(z), and for the amplitude of local non-Gaussianities, 
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/jvl- The reason behind this "miracle" is that bias- sensitive 
parameters such as j3 and Jnl are not subject to the same 
random processes that lead to different realizations of the 
density field for some matter power spectrum P(k). There- 
fore, given a fixed density field, by comparing the clusterings 
between different types of tracers of large-scale structure 
(i.e. objects which correspond to halos of different masses) 
we should be able to measure these parameters with a preci- 
sion that is not limited by cosmic variance. We should point 
out that the source of this extra information is not some 
sub-Poissonian shot noise that can be achieved by, for exam- 
ple, mass-weighting instead of bias-weighting (|Seliak et alj 
12009] ; ICai et al.ll201lf ). but is rather a direct consequence of 
the different relative amplitudes of the clustering among the 
tracers. 

Using these ideas, lHamaus et al.l l|201ll . [2012), con- 
structed a covariance matrix for the ratios of the clusterings 
of any two types of tracers, and used N-body simulations 
to obt a in enh anced constraints for j3 and /jvl- Conversely, 
ISlosarl (|2009T ) constructed the Fisher information for /ml 
directly from the covariance matrix for co unts in cells un- 
der sim plifying assumptions similar to ours. iGil-Marm et alj 
(2010), on the other hand, studied how non-linear bias and 
bias stochasticity can degrade the two-tracer constraints, 
and found that the actual improvements in the constraints 
may be up to ~ 50% smaller, co mpared with the ideal case 
of linear and deterministic bias. ICai fc Bernstein] (|2011al ) 
pointed out that bias modulations between dark matter 
halos imply correlations between the biases of the tracers, 
which limits the potentia l gains of the multi-tracer approach. 
ICai fc Bernsteinf (|2011bh then analyzed how weak gravita- 
tional lensing tomography surveys can be combined with 
redshift surveys in order to mitigate those correlations, and 
to break the degeneracies between the biases and the other 
cosmological parameters of interest. 

In this paper we show that the enhanced constraints of 
bias-sensitive parameters in multi-tracer surveys are a di- 
rect consequence of the Fisher matrix for multiple tracers 
of large-scale structure. The analysis is an extensio n of the 
single-tracer Fisher matrix of iFeldman et all (| 19941 ) (hence- 
forth FKP) to multiple tracers of large-scale structure. What 
emerges from the multi-tracer Fisher matrix is, b esides the 
usua l cosmic variance-lim ited "effective volume" |Tegmarkl 
1 19971 ; I Tegmark et al.|[l998l ). simple expressions which quan- 
tify the amount of information that lies in the "relational" 
degrees of freedom, which are completely independent of the 
effective volume. 

We show that there is a simple choice of variables which 
diagonalizes the multi-tracer Fisher matrix. The total effec- 
tive volume of the survey then appears naturally as one of 
the terms of this diagonal matrix, while the remaining terms 
correspond to the information in the relative clusterings be- 
tween the different species of tracers. The key aspect is that 
the Fisher matrix elements corresponding to the relational 
degrees of freedom are unbounded, and can in fact carry 
much more information about some parameters than the ef- 
fective volume - which is, of course, bounded by the physical 
volume of the survey. 

This paper is organized as follows: in Sec. [2] we review 
the statistics of galaxy surveys, and present the generaliza- 
tion of the FKP method to multiple tracers; in Sec. [3] we 
show how a certain choice of relative clusterings between 



the tracers leads to the diagonalization of the multi-tracer 
Fisher matrix; in Sec. [4] we show an application to a hypo- 
thetical survey of three types of tracers, and how the rela- 
tional information improves the constraints; our conclusions 
comprise Sec. [5] 



2 THE STATISTICS OF GALAXY SURVEYS 

The main observables galaxy surveys try to measure are the 
redshift-space matter power spectrum and its sub-products, 
such as the baryon acoust i c osc il lations (lEisenstein et al.l 
1 19991 ; iBlake fc Glazebrookl |2003l ; ISeo fc Eisensteinl |2003h 
and redshift-space distortions (|Hamiltonl l2005al lbh. In 
terms of the matter density contrast S(x), the position- 
space power spectrum is given by the expectation value 
(5(k,z)S*{k',z)) = {2%fP(k,z)8 D (k - fc'), where S D is the 
Dirac delta function. 

The problem with this program is that we cannot di- 
rectly measure the fluctuations of the matter field in position 
space. Instead, what we observe are counts of tracers of the 
large-scale structure in redshift space. In the simplest ap- 
proximation, bias and redshift distortions can be regarded as 
operators acting on the density contrast, such that the fluc- 
tuation field of some tracer a (where a stands for galaxies 
of a certain type) in redshift space is related to the underly- 
ing mass fluctuation field in position space by some relation 
8 a (k,z) ~ [b a + ffil]8(k,z) . Here S a = (n a - n a )/n a is 
the fluctuation in the number density of the tracer species 
a over the average (n„) in redshift space, b a is the bias of 
that tracer, / = a! log G/d log a is the RSD parameter (G 
is the matter growth function, and a the scale factor), and 
Hk = k ■ r is the cosine of the angle of the Fourier mode with 
the line of sight. 

As a first approximation, we can assume the bias to be 
linea r, deterministic, and scale-independent (|Bardeen et al.l 
1986). However, it is known that structure formation 
leads to scale-dependent, non-linear and stochastic bias 
jBenson et alj|200d; iDekel fc Lahavl Il999l ; IWeinberd |2002| ; 
ISmith. Scoccimarro fc Shethll2009r i. Moreover, even the pri- 
mordial spectrum of cosmological fluctuations may ef- 
fectively introduc e a scale-dependent bias through non- 
Gaussian feature s (Bartolo et al. 2004; Sefus atti fc Komatsul 
l2007l ; |Palal" et al ]|2008l ). Hence, we can regard the bias as be- 
ing not only a function of redshift, but a function of scale, 
having at least some stochastic component. The RSD pa- 
rameter and the angular dependen ce can also inherit scale - 
dependent non-linear corrections |Raccanelli et all I2012T ). 
Since the power spectrum, bias, RSDs, and even the se- 
lection functions are estimated from the same data, it is 
important to understand how accurately one can measure 
these quantities and how their covariances carry over to the 
uncertainties in the parameters of interest. In the absence 
of data, the best tool for studying these issues is the Fisher 
information matrix. 



2.1 The Fisher matrix 

The main sources of uncertainty in galaxy surveys are cos- 
mic variance, which is due to the finite volume of the sur- 
vey, and shot noise, which arises from the statistics of the 
counts of tracers. In the case of surveys with a single species 
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of tracer, the optimal weighting function for galaxy counts, 
which minimizes the joint contributions of cosmic variance 
and shot noise to the variance of the power s pectru m es- 
timates, was first obtained by iFeldman et all |l994). The 
corresponding F i sher informatio n mat rix was derived by 
iTegmarkl (|l997l l; iTegmark et al.l (1 19981 ). who also showed 
that this Fis her matrix follows fro m the covariance of counts 
of galaxies (|Tegmark et al.lll998T ). The FKP Fisher matrix 
for the survey of the tracer species a can be written as: 



Fql; ij — 



d 3 kd 3 x dlogP a 1 / P, 



(2tt) 3 



l+Pc 



d log V a 

ddi 



where we define: 



V a (k-x) := n a (x)P a (k;x) 



(1) 



(2) 



The dimensionless quantity P a , which we will call the effec- 
tive power, expresses the power spectrum of the tracer a in 
units of its shot noise (which we take to follow the standard 
Poisson distribution, l/n a ). Under the usual assumptions 
about bias and the nature of RSDs, the effective power can 
be expressed as: 

P a (k; x ) ->• n a (x) [b a + f(z)n 2 k ] 2 P(k,z) . (3) 

Equation JT]) shows that we can regard the quantity: 

2 



F a (k;x) = - 



Va 



2U + ?„ 



(4) 



as the Fisher information density in phase space. In fact, 
the F a (k; x) defined above is the Fisher information matrix 
for the parameter \ogP a (k;x), and Eq. Q shows that the 
Fisher information density per unit of phase space volume 
is limited by F a < |. Therefore, for a survey of a single 
species of tracer there is an absolute limit on the amount 
of information that can be extracted from a finite volume 
and a finite range of scales. That upper limit is precisely 
| for each volume element of phase space d 3 k d 3 x/(2ir) 3 . 
Cosmic variance is just the posit ion- space manifestation of 
this upper limit. 

The Fisher matrix of Eq. (0 can be used to estimate 
all sorts of parameters, including the underlying matter 
power spectrum. For estimations of the power spectrum, 
we typically assume that the spectrum of the tracer is 
proportional to that of matter, and take the parameters 
Q % — s> p % = log P(ki), where the ki are bins in Fourier space. 
In that case, it is trivial to see that Eq. reduces to: 



Fc: 



1 4-KktAh 

2 (2tt) 3 



d 3 x 



Pa 



l + V 



(5) 



where the inte gral over position space defines the usual ef- 
fective volume |Tegmark|[l997l : lTegmark et al.ll 19981 ). Hence, 
the uncertainty in the amplitude of the power spectrum at 
the scale ki is given by o~ 2 (p') = a%./P? — -^Vkt Veff{ki). 



2.2 Multi-species Fisher matrix 

When several types of tracers are observed over the same 
volume cosmic variance should remain unaffected, but the 
nature of shot noise means that counts of one type of galaxy 
affect the counts of the other types in a Fisher information 
matrix. The optimal (minimal-variance) est imator in the 
case of multiple tracers was first obtained bv lPercival et all 



(2003), who considered a continuous distribution of galax- 
ies with luminosity-dependent bias. Those results were later 
used to write a Fisher matrix for surveys of m ultiple species 
d White et al.ll2008l: iMcDonald fc SelialfeoOSl ). 

In lAbramol fcOlj )' it was shown that the multi-tracer 
Fisher matrix also follows from the covariance of counts of 
galaxies, under the usual assumptions and approximations - 
e.g. , that shot no ise does not apply to the cross-correlations, 
sec ISmithl (|2009T ). The results of that paper are summarized 
below. 

Let's define the total effective power as the sum of the 
effective powers of all N t species of tracers in a survey: 



V(k-x) 



N t 

5> 



(6) 



The Fisher matrix for the parameters \ogP a (k;x) and 
log Vp (k; x) is then given by (Abramo 2012) 



F a p(k;x) 



y a v ^ v a v P {i-v) 
l + r 



(1 + P) 2 



(7) 



The Fish er matrix defined in Eq. can be obtained 
from that o f White et al.l ((2008) by a simple projection. 
IWhite et al] (|2008h do not assume any relationship between 
the cross-power spectra of two species, P a p , and their auto- 
power spectra. We, on the other hand, work under the as- 
sumption that bias stochasticity vanishes for the relevant 
scales, so that the cross-power spectra are implicitly given 
in terms of the auto-power spect ra through relat i ons su ch as 



_oi2_ 



PqqPg 



. - se e, however, ISwanson et al.1 <|2008D and 
2008) for the limitations of this approach. 



iBonoli fc Per] 

By applyi ng this restriction t o the Fisher matrix for pairs of 
tracers in IWhite et all |2008), we obtain the Fisher matrix 
of Eq. 0. 

In terms of a more usual set of parameters, 9 Z , we have: 
d 3 kd 3 x dlogP, 



F, 



= E 



(27T) 



d0< 



-F 



a/3 - 



d log Pi) 
dBi 



(8) 



■ ~ y ] FaP ; ij ■ 

a/3 

Notice that the Fisher information density is symmetric on 
the tracer species indices, F a p(k;x) = Fp a {k;x), but, from 
the definition above, Fc^-ij — Fp a .ji 7^ Fp a - t ij. 

Eq. ((U shows, as was already the case for Eq. (3). that 
we ought to call F a p the multi-tracer Fisher information 
density per unit of phase space volume. In the case of a 
single species of tracer Eq. ([4| tells us that the information 
density in phase space has an upper bound of i; for multiple 
tracers Eqs. (0 and ([8]) reveal that the information density 
is unbounded. 

In the case of two tracers, the Fisher matrix for the 
parameters \ogP\{k\x) and \ogp2(k;x) is given by 



F aB = 



Vl v , yf(i--P) 



(HP) 2 



V1V2O--V) 

(1+vy 2 



l + V (1 + P)2 



(9) 



Either from Eq. © or from Eq. ([9]) it can be seen that the 
individual elements of the multi-tracer Fisher matrix are 
unbounded, in contrast to the single-species Fisher matrix 
of Eq. Q. Nevertheless, as we will see next, cosmic variance 
is still manifested in the multi-tracer Fisher matrix. 
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We can readily obtain the FKP Fisher matrix for the 
power spectrum from Eq. ((8]) by noting that, if the 6 l axe 
limited to parameters of the power spectrum, p ! , which 
are shared equally by all tracers, then dlogVa/dO 1 — > 
d\ogP/dp r . In that case, the sum over tracers in Eq. 1(8} 
can be brought inside the integral, resulting in: 



Fisher matrix are defined as follows: 



F T 



a/3 



V 



2 \1 + P 



1 

<2 



(10) 



This then leads to the familiar expression for the Fisher 
matrix, shown in Eq. (5). provided we make the natural 
identification V a — > V = ^2 a V a - This result means that, 
when measuring P(k,z), we are always limited by cosmic 
variance (through the effective volume), and shot noise is 
determined by the sum of the effective powers. Under the 
usual assumptions, this total effective power reduces to V — > 

iSaMk+ZA**) 3 ] P(k,z). 

However, even though the Fisher information density 
for the power spectrum has an upper limit, Ft < §, the 
individual components of the Fisher matrix density, F a p, 
are not bounded. In fact, this upper limit is only relevant in 
the case of parameters for which the derivatives d log Va/dO 1 
are independent of the tracer species a. This is the case, e.g., 
for the power spectrum and for the matter growth function 
G(z), but is not the case for the biases of the tracers, for the 
RSD parameter, or for any other bias-sensitive parameter 
such as Jnl- 

In particular, this means that the Fisher information 
density for some parameters can be much larger than the 
nai've ^ bound. Indeed, inspection of Eq. Q shows that the 
diagonal components of F Q( 3 can be arbitrarily large when 
some of the effective powers V a 3> 1. The cross-terms (a 7^ 
ft), on the other hand, can be very large and negative, but 
this is precisely what is necessary in order to preserve the 
constraint of Eq. (fT0|) . 



3 DIAGONALIZED MULTI-SPECIES FISHER 
MATRIX 

Even if some individual components of the multi-tracer 
Fisher matrix F a p are arbitrarily large, this does not nec- 
essarily mean that the Fisher matrix for the parameters, 
shown in Eq. (|8]), will inherit these enhancements after 
summing over all the tracers and accounting for the cross- 
correlations. We will now show that this is in fact the case, 
by diagonalizing the multi-tracer Fisher matrix. 

The parameters of the multi-tracer Fisher matrix in Eq. 
(0 are the variables \ogV a {k\ x) (a = 1, . . . , Nt). Let's de- 
fine, in a manner similar to Eq. (|6]), the partial sums: 



(11) 



where a = l,...,Nt, and by this definition Si = V and 
5jv t = VNf The variables S a can be regarded as the aggre- 
gate effective powers. The order in which the effective powers 
V a are organized in these sums is irrelevant. 

The set of variables which diagonalizes the multi-tracer 



yi = 

34 = 

y a = 

y Nt = 



Si -- 

Vi 

& 

Ta- 



V 



(12) 



S a 



7>N t - 



(13) 



(14) 



Sjv t 74v t 

Hence, the new variable 34 is simply the total effective power 
of the survey, while the other variables y a (a 7^ 1) are rela- 
tive effective powers. These relative powers are ratios of the 
individual effective powers to the aggregate effective pow- 
ers S a - While any ordering of the effective powers can be 
used for defining the variables S a and y a , it may be more 
intuitive to organize the tracers in order of the volume that 
they cover in the survey, so that the tracer with the least 
volume would correspond to 74 and the tracer covering the 
most volume would correspond to Pjv t . However, one may 
also organize the effective powers as a function of bias (or 
halo mass) . In either case, the Fisher matrix always remains 
well-behaved, even if some of the V a happen to vanish. 

Changing variables from log V a {k; x) to log 34 (fc; a;) 
leads to a completely diagonal Fisher matrix: 



F a . 



^d 



dlogP Q dlogVp 





logy a 



diogy b 



— 6 ao T a 



The elements of this diagonal Fisher matrix are: 



To 



V 



i + r 

1 V S a Va-l 



= F T 



(a/1) 



(15) 

(16) 
(17) 



4 1+P Sa-l 

Therefore, the variables log 34, with the definitions of 
Eqs. (|12|l - (113l ). are the eigenvectors of the multi-tracer Fisher 
matrix, and its eigenvalues are the T a of Eqs. (|16[) - (|17[ ). 

It follows that the variable 34 = V (the total effective 
power of the survey) is completely independent from the re- 
lational variables 34 (a / 1), and, by Eq. (|16ll . its informa- 
tion density is strictly limited by the constraint < T\ < i. 
The variables J (a 7= 1) are also independent of each other, 
but from Eq. (117[) the information densities associated with 
them can assume any (positive) value. Since the multi-tracer 
Fisher matrix is diagonal, the uncertainties in log 34 are 
given by the square roots of the inverses of Eqs. (|16|l - (|17l ). 

Any linear combination of the variables log 34 con- 
structed through the action of an ^-dimensional orthogonal 
transformation would still lead to a diagonal Fisher matrix. 
It can be verified that permutations of the effective powers 
V a generate 0(Nt — 1) orthogonal transformations between 
the variables log 34 (a 7^ 1) which are equivalent to the cor- 
responding redefinitions according to Eqs. II 1211 - II 131 ). 

As a concrete example, take a survey of two species 
of tracers. In that case, we have 34 = V\ + V2 = V, and 
34 = VxjVi- The Fisher matrix element associated with 
log 34 is T\ = \ V 2 /(I + V)' 2 , and the Fisher matrix for 
log 34 is T% = JV1P2IO- + V). Exchanging V\ and Vi leaves 
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Pi 

Figure 1. Fisher matrix elements in the case of two species of 
tracers. The Fisher information density T\ of Eq. I |16jl . which is 
associated with the total effective spectrum y± = V = "Pi + V2 , 
is shown by the dashed lines for various values of V2 (0.5, 1, 2, 
4, and 8, from the bottom up). The Fisher information density 
J-2 = / (\ + V\ + V2), associated with the relative power 

3^2 ='Pi/T > 2, is shown by the solid lines. 



log yi invariant, introduces an irrelevant change in the sign 
of log 3^2 — t — log 3^2, and leaves both J-\ and T% invariant. 

The behavior of the two independent components of the 
Fisher information density for the two-tracer case are plot- 
ted in Fig. [T] For small values of the effective powers V\ 
and V2, it is T\ which has the largest information density. 
However, for large values of the effective power (that is, for 
large enough densities of the tracers), it is T2 which carries 
the most information density. In the limit Pi > 1 we see 
that T\ — ► |, while Ti — > 7V2; hence, when both Pi > 1 
and V2 S> 1, we have Ji«j, but F2 ^> 1. If Vi or V2 van- 
ishes in some region of space, this region will not contribute 
with any information about their ratio (34), although it does 
contribute to the usual Fisher matrix density (the one as- 
sociated with 3T = V). When both tracers vanish in some 
region of space, then the whole Fisher matrix also vanishes 
identically. 

This diagonal form of the Fisher matrix has an addi- 
tional advantage: it reduces the amount of computations 
needed for practical applications. Instead of the Nt (Nt — 1)/2 
sums and integrations in Eq. j5). we only need to compute 
Nt terms: 



Nt 

= E 

a = l 



d 3 kd 3 x d log 34 _ dlog^, 



(27T) 



(18) 



This means if it ever becomes possible (and desirable) to 
divide the tracers in a survey into 100 different types, we 
only need to compute the 100 terms in Eq. (|18p . instead of 
the ~ 5. x 10 3 terms needed for the non-diagonal form of 
the Fisher matrix. 



4 APPLICATIONS TO FUTURE SURVEYS 

As an application of these results, we study how the relative 
clusterings improve cosmological constraints for a hypothet- 
ical redshift survey that can detect three types of tracers 
of large-scale structure. These tracers were chosen to repro- 
duce, as much as possible, the properties of luminous red 



ft) 
a; 



2.0 
z 



Figure 2. Effective powers Vb. (LRG-like, red in color version), 
V E (ELG-like, yellow), and Vq (QSO-like, blue), evaluated at 
k = 0.1 /i Mpc" 1 , and across the line-of-sight (/ij. = 0). 



galaxies (LRGs), emission- line galaxies (ELGs) and quasars 
or AGNs (QSOs). The LRG-like tracers are relatively rare, 
have a somewhat high bias, and are shallow [z < 1.5). The 
ELG-like tracers are more abundant, have a relatively low 
bias, and can be detected to higher redshifts compared to 
LRGs (z < 2). The QSO-like tracers are very rare, have a 
very high bias, an d can be detected to very high redshifts 
(z < 4) - see, e.g. jAbramo et all l|2012i ). 

Fig. [2] shows the effective powers for each species of 
tracer, computed at the typical scale of k — 0.1 h Mpc -1 , 
for modes perpendicular to the line-of-sight (/i*, = 0). Since 
the effective power is a measure of shot noise (a high value 
of V a indicates very low shot noise), the effective powers 
chosen for Fig. f2] cover several different scenarios that one 
may encounter in real surveys. 

We have assumed that the survey covers 10 4 deg 2 , 
which, for the number densities we have considered, imply 
total numbers of 2 x 10 7 for the LRG-like tracers, 5 x 10 7 for 
the ELG-like tracers, and 3 x 10 6 for the QSO-like trac- 
ers. We also assumed that the redshifts are accurate to 
cr z = 0.001(1 + z), and these uncertainties were factored 
into the Fisher matrix in the usual way, through a factor 
exp[— k 2 fj, 2 . a 2 c 2 H~ 2 ] which multiplies the Fisher informa- 
tion density. We have also cut-off the Fourier-space integra- 
tions at k — 0.1 h Mpc -1 , in order to avoid contributions 
from scales where non-linear effects become essential. 

We employ the same method that has been extensively 
used to make forecasts using the po wer spectrum and bary- 
onic acoust ic oscillations (BAOs) |Seo fc E iscnstcin 120031 ; 
IWanell2006h . We assume that the observed power spectrum 
for a given tracer can be expressed in terms of a "reference" 
model as: 



P obs (z;k ref ,n r k ef ) 



D A (z) 



(19) 



where: 



P l (z;k,^) = G 2 (z) [b(z) + f(z) 4] P,(0;fe) 



(20) 



In this last equation, Pi(0,k) denotes the linear theory, po- 
sition space power spectrum at z = 0. In Eq. (|19|l the 
wavenumbers in the fiducial model are related to those in 
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another, arbitrary cosmology, by 
H ref (z) 



H{z) 



k7 f = kj 



D A (z]_ 

*) 



(21) 



Changes in the properties of k and \i k due to changes in 
the cosmological model follow from their definitions, k = 

For simplicity, we will follow 



fen + k\, and fj, k 



kljk 2 . 



IWand | 20061 ) and absorb the normalization of the power 
spectrum, as well as the matter growth function, inside the 
prefactor of Eq. (|20[) . This leads to the following spectrum 
for the species a: 

P a (z-k,f, k )=[s a (z) + f s (z) f i 2 k ] 2 , (22) 



where the effective bias for the tracer a is defined as: 

s a = b a x <7 8 G(z) , (23) 

and the effective RSD parameter is given by: 

f. = fx a s G(z) . (24) 

We will employ the subscripts R, E and Q to refer to 
the LRG-like tracer, the ELG-like tracer, and the QSO-like 
tracer respectively. 

Our fiducial model is a standard, flat ACDM model with 
fi m = 0.27, h = 0.7, n s = 0.96, fi„ = and f NL = 0. In 
this model, the RSD parameter is very well approximated 
by /(*) = -dlogG(z)/dz ~ n° m 55 (z). 

Our set of parameters is the following: 

9 1 = {log H{z),\og D A (z), s r (z), s e (z), sq(z), f B (z), f N L(z)} 

(25) 

In this paper we will discuss only the conditional errors 
(i.e. the reciprocals of the Fisher matrix elements), hence 
we have not included any global cosmological parameters 
such as fi m , h, or n 3 in our analysis. It only makes sense to 
include global parameters, and their covariance with the pa- 
rameters on each redshift slice, when we also consider priors. 
However, this would obfuscate the contribution that arises 
from the relative clusterings between the tracers, so we leave 
this important issue to a future paper. 

In Figs. [3] and [4] we show the Fisher information for 
log H (z) and log Da (z) on each redshift slice. These Fisher 
matrix elements are related to the conditional errors by 
F(log H, log if) = H 2 /a 2 (H), and similarly for log Da (<t c 
are the uncertainties assuming that all other parameters are 
kept fixed). The information in the first few redshift slices is 
smaller simply because the volume of the survey grows very 
fast with redshift. 

In Figs. 1 3191 the thin short-dashed lines correspond to 
the Fisher information obtained by using only the LRG-like 
tracer (red in the color version), only the ELG-like tracer 
(green), or only the QSO-like tracer (blue). The thick solid 
line shows the Fisher information obtained by using all the 
tracers, as well as their cross-correlations; it is always above 
the Fisher information from the individual tracers. The thick 
long-dashed line (grey in color version) corresponds to the 
Fisher information which comes from the total effective vol- 
ume of Eq. (116[) . and is therefore the quantity which is sub- 
ject to cosmic variance. The long-dashed lines (dark and 
light purple in color version) correspond to the information 
in the relative clusterings between the three species of trac- 
ers, and are not subject to the same bounds as the effective 
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Figure 3. Fisher matrix element F(log H, log H) = H 2 /a 2 (H), 
where o c (H) are the conditional errors on H. The thick solid 
(black) line corresponds to the total Fisher matrix, including in- 
formation from all of the tracers as well as their cross-correlations. 
The thin short-dashed lines correspond to the Fisher informa- 
tion obtained by considering only the individual tracers (in the 
color version, red, green and blue correspond to LRGs, ELGs and 
QSOs, respectively). The thick long-dashed line (gray in color 
version) corresponds to the Fisher information associated with 
the total effective volume of Eq. H16I I. and is subjected to the 
limitations imposed by cosmic variance. The long-dashed lines 
(dark and light purple in color version) correspond to the the 
contributions to the Fisher matrix which come from the relative 
clusterings between the different tracers, shown in Eq. 11171 . The 
inset shows the contribution of the relational information to the 
total information on each redshift slice. 




Figure 4. Fisher matrix element F(log Da, log Da) = 
D 2 A j o 2 (D a)- The lines and the inset follow the same prescrip- 
tion as described in the caption for Fig. \3\ 



volume - see Eq. (|17[) . The sum of those three contributions 
(the thick long-dashed lines) is exactly equal to the total 
Fisher information (denoted by the thick solid line). 

The insets (upper right corners) in Figs. [3][9] show the 
fraction of the Fisher information contributed by the rela- 
tive clusterings between the tracers. As one can see, there 
is a ~ 30% enhancement in the information of H(z), and 
a ~ 20% enhancement in the information of Da(z), which 
comes from the information in the relative amplitudes of 
the clusterings. These enhancementes can be understood as 
follows: although the prefactor of Eq. (|19|) contains most 
of the information about H{z) and Da(z), that coefficient 
cancels out from the ratios between the effective powers, 
and therefore does not contribute anything to the Fisher in- 
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Figure 5. Fisher matrix element F(sr,sii) = s'r/^c^r) f° r tne 
effective bias of the LRG-like tracer, sr = bn X agG(z). 



Figure 7. Fisher matrix element F(sq, sq) = Sq/ct^(sq) for the 
effective bias of the QSO-like tracer, sq = 6q x <jgG(z). 



l.Ox 10" 
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Figure 6. Fisher matrix element F(se, sg) = s e /cf 2 (se) for the 
effective bias of the ELG-like tracer, sg = &g X (TgG(z). 



Figure 8. Fisher matrix element for the effective redshift distor- 
tion parameter, / s (z) = f(z) ag G(z). 



formation of the relative clusterings. However, the observed 
spectra also depend on H(z) and Da{z) through the ra- 
dial and angular components of the Fourier modes [see Eq. 
(|21[) ]. and that leads to some amount of information about 
BAOs in the relative clusterings. In fact, by increasing the 
redshift errors to a z > 0.01(1 + z) we lose the ability to mea- 
sure anisotropics in the clustering, and as a consequence the 
Fisher information from the relative clusterings to H(z) and 
Da(z) becomes negligible. 

In the case of the biases of the tracers, there is a dra- 
matic improvement coming from the relative clusterings. In 
Figs. [5] [6] and [7] we show the Fisher information for the ef- 
fective biases of the LRG-, ELG-, and QSO-like tracers, re- 
spectively. For the biases, the information arising from the 
relative clusterings is almost always dominant with respect 
to the information which comes from the effective volume. 
Moreover, including other species of tracers with different 
biases improves the information in the original bias by an 
order of magnitude, compared with the single-tracer anal- 
ysis. There is a caveat, though: including other species of 
tracers also introduces cross-correlations between the mea- 
surements of the biases; so the conditional errors implied by 
the information shown in Figs. 0[7| are substantially under- 
estimated with respect to the actual, marginalized uncer- 
tainties in those biases. 

The most significant results are those for the effective 
RSD parameter, f s (z) — f(z) as G(z), and for the local non- 
Gaussian parameter, /atl- We model the non-Gaussianities 



in the usual way (|Palal e t al. 2008;|), through a bias correc- 
tion: 



Ab NL =f NL (b-l) 



3 S c O m Hq 
c 2 k 2 T{k)G{z) 



(26) 



where S c — 1.68 is the critical linear density for spherical 
collapse, and T(k) is the matter transfer function normalized 
to unity at large scales (fc = 0). This may not be a good 
approximation for quasars l|Slosar et al. 2008), for which the 
factor (b — 1) in Eq. (126 p should perhaps be substituted by 
(b — 1.6), but we have kept the original definition for the 
sake of comparison between tracers of different biases. 

In Fig. [8] we show the Fisher information for f s on each 
redshift slice, and it is clear that the information from rela- 
tive clusterings can enhance the information in the effective 
volume by up to a factor of ~ 10. The largest improvement 
is precisely at the lowest redshift slices, which are also the 
ones which are most sensitive to dark energy and modified 
gravity models. This figure shows that, in order to measure 
the RSD parameter with high accuracy (both as a function 
and as a function of scale), we need to observe many more 
tracers at redshifts z < 1 than a simple argument based on 
shot noi se would indicate. Th ese results confirm and extend 
those of lHamaus et al] (|2012l '). 

The Fisher information for /nl is shown in Fig- EI There 
is a dramatic enhancement in the information at low red- 
shifts, with respect to what we expect solely from the ef- 
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Figure 9. Fisher matrix element for local non-Gaussianity pa- 
rameter /jvi- 

fective volume. In fact, the improvement is so large that 
it may seem at odds with the fact that non-Gaussianities 
are manifested through bias-dependent modifications to the 
transfer function on large scales. However, for the scenario 
that we considered in this Section, there is so much more 
information density in the low redshift slices (where there 
is a high density of tracers), which come from comparing 
the clusterings between the different tracers, that this more 
than compensates for the relatively small scales spanned by 
those low-2 slices. The bottom line is that it is possible to 
enhance the constraints on /jv l by large facto rs. This is also 
consistent with the findings of lHamaus et all (|201ll ). 



5 CONCLUSIONS 

We have shown how galaxy surveys can constrain bias- 
sensitive parameters to an accuracy which is not limited 
by cosmic variance. This emerges naturally from the multi- 
tracer Fisher information matrix, whose eigenvalues include 
both the effective volume (which is limited by cosmic vari- 
ance) and the information from comparing the relative am- 
plitudes of clustering between tracers. It is trivial to employ 
the methods described in this paper to forecast the power 
of any survey, and to any number of tracers of large-scale 
structure. The task of producing forecasts is further simpli- 
fied due to the diagonalized form of the multi-tracer Fisher 
matrix. 

In order to promote clarity we made several simplifying, 
but probably over-optimistic assumptions: e.g., we took the 
shot noise of auto-correlations to be Poissonian, but assumed 
that there is no shot noise for the cross-correlations between 
different tracers. We also assumed that the different trac- 
ers of large-scale structure correspond to non-overlapping 
halo mas ses, so that the b i ases a r e not correlated - and, a s 
shown bv lGil-Marin et al.l < |2010h ; ICai fc Bernstein! (|2011aT ). 
these correlations limit the benefits of the multi-tracer ap- 
proach. In the context of the formalism presented in this 
paper, we can go beyond these simple assumptions by con- 
sidering a more realistic covariance for the counts in cells, 
and then computing the corresponding multi-tracer Fis her 
information matrix, as was carried out in lAbramoll|2012l ). A 
particularly useful extension would be to compute the opti- 
mal binning for the bias in terms of the distributions of halo 



masses, including the cross-correlations between the biases 
of the different tracers. 

Moreover, because priors and marginalizations tend to 
obfuscate the different contributions that determine each 
constraint, we have decided to focus on the Fisher infor- 
mation itself, which is related to the conditional errors by 
a c (9 % ) — 1/ V Fa . However, it is clear that considering several 
different types of tracers also brings additional covariances, 
so the actual constraints (after all the marginalizations and 
priors) will not be enhanced by the same factor. A crucial 
next step to our analysis would be to consider the marginal- 
ized constraints subject to some suitable sets of priors. I n 
particular, as already shown by ICai fc Bernstein! (|2011bl ). 
the role of lensing as a means to determine priors on the 
biases of different tracers of large-scale structure seems a 
promising way to mitigate some of the covariances inherent 
to multi-tracer surveys. 

Our analysis shows that the parameters whose con- 
straints have the most to gain from a multi-tracer survey 
are the RSD parameter, f(z), and the local non-Gaussianity 
parameter, /jvl - For the example employed in Section^ the 
Fisher information of the effective RSD parameter increases 
by a factor of up to ~ 10 when the information from rela- 
tive clustering is included, which means a factor up to ~ 3 
reduction in the conditional errors for that parameter. For 
/jvl, the Fisher information is boosted by an even more 
dramatic factor, of ~ 20. For the hypothetical survey and 
tracers that we considered, the largest gains would occur 
at low redshifts, because of the higher number densities of 
objects in the low-z slices. 

Our analysis also shows that it is possible to obtain 
enhancements in the determination of radial and angular 
BAOs from the relative clusterings: in the scenario we con- 
sidered, these enhancements were up to ~ 20% for the Fisher 
information of the angular distance Da(z), and up to ~ 30% 
for the Fisher information of H(z). The explanation for these 
enhancements comes from the fact that radial and angular 
BAOs are also manifested in the anisotropics of the cluster- 
ing, through fik - see Eq. (1211). This is in fact a m ulti-tracer 
Alcock-Paczynski test |Alcock fc Paczvnskilll979! ). hence it 
depends on whether or not the redshift accuracy of the sur- 
vey allows the measurement of anisotropies in the redshift- 
space clustering. Indeed, we have checked that for redshift 
errors a z > 0.01(1 + z) there is basically no additional con- 
tribution from the relative clusterings to the Fisher infor- 
mations of Da(z) and H(z). 

The key point in this paper is that whenever a red- 
shift survey detects tracers of large-scale structure whose 
clustering amplitudes are different, comparisons between the 
clusterings tap into additional sources of information about 
the parameters that describe them. It may be useful to 
think of a "generalized bias" for each tracer, of the form 
B a (z;k, Hk) = b a (z;k) + fa(z; + ... , which includes 
as many parameters and dependencies as one may wish to 
consider. As long as there are sufficiently high densities of 
tracers with distinct (i.e., non-degenerate) biases, we can 
measure B a (z; k,^^) in bins of z, k and fik to an accuracy 
which is, in principle, not limited by the volume of the sur- 
vey. 
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